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Abstract : Risk control and optimal diversification constitute a major focus in the 
finance and insurance industries as well as, more or less consciously, in our everyday 
life. We present a discussion of the characterization of risks and of the optimization of 
portfolios that starts from a simple illustrative model and ends by a general functional 
integral formulation. A major theme is that risk, usually thought one-dimensional in 
the conventional mean-variance approach, has to be addressed by the full distribution 
of losses. Furthermore, the time-horizon of the investment is shown to play a major 
role. We show the importance of accounting for large fluctuations and use the theory 
of Cramer for large deviations in this context. We first treat a simple model with 
a single risky asset that examplifies the distinction between the average return and 
the typical return, the role of large deviations in multiplicative processes, and the 
different optimal strategies for the investors depending on their size. We then analyze 
the case of assets whose price variations are distributed according to exponential 
laws, a situation that is found to describe reasonably well daily price variations. 
Several portfolio optimization strategies are presented that aim at controlling large 
risks. We end by extending the standard mean-variance portfolio optimization theory, 
first within the quasi-Gaussian approximation and then using a general formulation 
for non-Gaussian correlated assets in terms of the formalism of functional integrals 
developed in the field theory of critical phenomena. 
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1 Introduction 



Finance and insurance are all about risk, and risk is usually thought to be all encap- 
sulated by variances and covariances. In practice however, variances and covariances 
are quite unstable with the existence of intermittent bursts of high volatilities. More 
generally, risk is embedded in the distribution of asset returns and not solely in vari- 
ance. In finance and in fact in many of our actions, we have to ponder the choice 
between risky alternatives with different rewards. How to make the best choice(s)? 

"Don't put all your eggs in one basket" is a familiar adage. The idea is then to 
try to minimize somehow the risk by diversification, a procedure that is exemplified 
by the problem of portfolio selection and optimization in finance : how to choose a 
basket of assets which maximizes your profit and minimizes your risk? 

While these considerations seem quite straightforward and reflect the common 
sense, putting them in practice provides some surprise. Indeed, the very problem of 
the quantification of the notion of risk does not find a simple general unambiguous 
answer. Even if the stochastic process describing the set of potential profits and losses 
is stationary and can thus be described by a well-defined probability distribution, the 
problem of ordering distributions does not have a unique answer. This means that, in 
general, there does not exist a unique universal measure of risk, a single number that 
can be attributed to a risky situation. This is in contrast to the much better known 
problem of ordering numbers. As soon as one has to deal with pairs of numbers, 
vectors, matrices, and even more so distributions, the problem of ordering is much 
harder and in many cases has not a unique answer. There is a vast litterature that 
addresses this question dealing with utility functions 0, stochastic dominance and 
so on [Q. 

In section 2, we first present a simple model of a portfolio made of one risky 
and one riskless asset, which allows us to introduce the key ideas, namely the im- 
portance of the time-horizon of the investment, the distinction between the average 
return and the typical return, the role of large deviations in multiplicative processes, 
and the different optimal strategies for the investors depending on their size. In 
section 3, we analyze the case of assets whose price variations are distributed accord- 
ing to exponential laws, a situation that is found to describe reasonably well daily 
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price variations. This parametric case is used to illustrate the portfolio optimization 

strategics that aim to control large risks. In section 4, wc summarize first the stan- 
dard mean-variance portfolio optimization theory. We then show how to extend it 
within the quasi- Gaussian approximation. Finally, we provide a general formulation 
for non-Gaussian correlated assets in terms of the formalism of functional integrals 
developed in the field theory of critical phenomena. 

2 An exactly soluble case : one risky and one risk- 
less assets 

Let us assume that the financial world is only made of a riskless asset and a risky 
asset. Without loss of generahty, the return of the riskless asset can be taken zero by 
a suitable change of frame. Time is discrete in units of time r taken to be unity. The 
return over a unit time period r of the risky asset is a random variable — oo < r < -|-oo 
without correlations, taken from a distribution P{r). An investor has a wealth equal 
to unity at t = 0. Should he invest a fraction / in the risky asset? What is the 
optimal fraction /? 

We start by treating this simple case because it is particularly useful to introduce 
the key concepts that will be useful for the general problem of the diversification 
and optimization of portfolios. The standard mean-variance approach comes about 
very naturally and this models allows us to discuss its limits and generalization, in 
particular to take into account the existence of large deviations. 

2.1 General case 

Here, we assume that the problem is "separable", i.e. / is a constant independently 
of the wealth. A more general case consists in evolving / as a function of the wealth 
of the investor, to account for the phenomenon that the degree of risk aversion is a 
function of wealth. This is usually addressed using the formalism of utility functions, 
which is mathematical device that quantifies the degree of satisfaction of an investor. 
Within this approach, the optimal fraction becomes a function f{S{t),t) of time and 
of the wealth S{t). 
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The choice / = corresponds to an investor who is absolutely opposed to taking 
any risk and keeps his entire wealth in cash (or invested at the riskless return). Other 
investors will recognize that taking some risks by investing in the risky asset might 
provide gain opportunities above the riskless return. But this has a cost, the risk, 
and the question is how to choose the optimal degree of risk/return. 

Let us evaluate the value S{t) of the portfolio comprising a fraction / of the risky 
asset after t time steps. If we note ri,r2,...,rt the specific returns observed during 
these t time steps, the wealth becomes 

Sit) = n (l - / + fe'^)= n (l + (e^^- - 1)/) . (1) 

This expression contains all the information, as long as we can enumerate all possible 
scenarios ri, r2, r^. In the absence of correlation between the returns rj, all the 
information is in the distribution P{S(t)) of the wealth. The problem thus amounts 
to determine P{S{t)) knowing the distribution P{r) of the one-step-returns of the 
risky asset. 

Consider first the situation where the returns vj are small [rj << 1 for all j), 
e'"-' — 1 ~ Tj and 1 -|- r^/ ~ e*"^-^. The expression (|1|) becomes 

5(t)^ne^^/=exp(/^r,). (2) 

i=i ^ i=i ^ 

J log S(t) is the sum of t random variables and the central limit theorem gives 

where m = (rj) and o"^ = (r|) — m^. The mean return of this portfolio is r(/) = mf 
and its variance per unit time is v{f) = cf'^p. 

In this Gaussian limit, the risk is completely captured by the single number f (/). 
For a given return r = m/, we get 

^ r- I A\ 

r = -v^ , (4) 

a 

by eliminating /. This relationship (^) sums up the standard result of the celebrated 
Markovitz portfolio theory ^, namely the more risk one takes (quantified by ^/v, 



the larger the expected return r. There is no best optimal strategy / but rather a 
continuum of possibilities offered to the investor and whose choice depends on his 
risk aversion. 

In the expression (D, S{t) behaves typically as S{t) = e'^*+''^, where ?7 IS a 
gaussian random variable with zero mean and unit variance. S{t) becomes larger 
than one with almost certainty for times larger than the characteristic time 

- |. (5) 

at which the mean return rt becomes comparable with the amplitude ^/2vt of the 
fluctuations. Thus, the decision to invest a fraction of his wealth in the risky asset 
must take into account the time interval during which he decides to immobilize his 
investment 0. If this interval is too short, the results will be dominated by the 
fluctuations and the risk is large. The investment in the risky asset becomes profitable 
only for time intervals larger than t*. The existence of this temporal factor in the 
investment strategy can be embodied in the so-called Sharpe ratio defined by the 
ratio of the expected return over the standard deviation 

r 

Sharpei = —= . (6) 

Sharpei can also be expressed as Sharpei = \/2[t*{f)]~^^'^. The Sharpe ratio depends 
on the time scale. A good investment strives to decrease the time scale t*{f) beyond 
which the return is not anymore sensitive to fluctuations, hence to increase the Sharpe 
ratio. 

2.2 An illustrative case 

Before addressing the question of large risks, let us make concrete the previous results 
by specifying the distribution P{rj) of the risky asset. We consider a binomial case 
which has the advantage of allowing for an exact treatment. The return of the risky 
asset is thus assumed to be either 

e''^ = A > with the probability p, (7) 

or 

e^^ = with the probability 1 — p. (8) 
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This particular case can be interpreted as a casino game and has first been studied 
by Kelly 0, in relation with the theory of information which allows one to derive 
naturally the mean-variance approach in the Gaussian world. 

A unit currency, from which a fraction / is invested in the risky asset, becomes 
on average after one time step : 

p[l + (A - 1)/] + (1 - - /] = 1 + (pA - 1)/ . (9) 

It is clear that only the case pA — 1 > is interesting and lead to investment strategies 
with / 7^ 0. The average return per time step is log[l + {p\ — 1)/] while the average 
gain is {p\ — l)f. 

In this model, the scenario ri,r2, ...,rt reduces to the knowledge of the number j 
of times where a gain was realized. The case (H) for a loss then occurred t — j 
times. The expression (P becomes 

^,(t) = (i + (A-i)/)'(i-/r^'- (10) 

The probability that this wealth is realized is simply the binomial law 

vij) = Qp^{i-py-' . (11) 

The problem of the optimizing the investment is completely captured by the two 

formulas ([lO| ) and (|Tll). In the limit of large investment times, the Stirling formula 

leads to the following expression 

. X 1 a-pt)^ , , 

V{j) ~ =e 2tp{i-p) ^ (12) 

'27rtp{l-p) 



valid close to its maximum. This expression (^2|) is nothing but the translation of 
the central limit theorem applied to the Binomial law, where the sum is of the type 
J2i=i with Xi = 1 with probability p and Xi = probability 1 — p. Expressing j as 
a function of Sj(t) in (|TUp, 



log^ 



(1-/)' 



logi±(^ ' 



(13) 



1-/ 

we obtain the (log-normal) distribution of portfolio wealths at time t 



(log 



S{t) 



nSit))c^ ; expf- ^ ) , (14) 



where 

= (l + (A - l)fj\l - (15) 

is the most probable value of S{t), i.e. the wealth that maximize the probability. It 
simply corresponds to the most probable value of j equal to pt. The variance f (/) is 
given by 

v{f) = p{l - p) (log lliizilZy . (16) 

In this Gaussian limit, the portfolio selection problem is completely embodied by the 
two variables: the average return r(/) defined by r(/) = Ihi^eIII^ i.e. Spp{t) = e''*, 
giving 

r(/) = plog(l + (A - 1)/) + {l-p) log(l - /), (17) 

and the variance f (/). Eliminating / allows us to obtain the mean- variance repre- 
sentation r{v) which has become standard in portfolio theory. 

Figure 1 shows r and f as a function of / for p = 1/2 and A = 2.1. These numerical 
values give an average gain (|) pA — 1 of 5%. Figure 2 gives r as a function of v for 
the same values. For / small, r ~ {pX — l)f and v ~ p{l — p)A^/^, which retrieves the 
dependence r oc ^/v found in the preceeding section. Figure 3 shows as a function of 
/ for r > the behavior of t*{f) defined by (§) and of Sharps = V2[t*{f)]-^/^ = ^ 
defined for a unit time step. It is profitable to invest in the risky asset only if one is 
ready to freeze the investment over a period larger than t*{f). This time dimension is 
often forgotten or dismissed in the more standard approach that does not distinguish 
between the most probable and the mean gain. If the investment period t is fixed, 
the best portfolios correspond to those with t* (/) < t. This introduces the constraint 

We observe that, for pA > 1, r exhibits a maximum 

Tmax = plog Ap + (1 - p) log _ ^ (18) 

reached for an optimal fraction /* invested in the risky asset which is equal to 

/• = (19) 

For p = 1/2 and A = 2.1, /* = 4.54%, r„„^ = 0.113% and v{f*) = 0.00227. The 
volatility y/v, which gives the order of magnitude of the fluctuations of r, is equal to 
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4.76%, i.e. r (/*) ~ 3500r and Sharpei{f*) = 0.024. For p = 1/2 and A = 2.5, r (/*) 
becomes smaller than 200r, corresponding to Sharpei = 0.07, with an optimum 
return better than 2%. For p = 0.9 and A = 1.2, t*{f*) is also of the order of 200r, 
corresponding to Sharpei = 0.07, with a similar optimum return 2%. As pX becomes 
much larger than 1, /* increases as it becomes more and more interesting to invest 
in the risky asset. For p = 1/2 and A = 5, r^ax = 2.5%, /* ~ 0.4, ^ 20 and 

Sharpei = 0.25. For A — oo and in the regime of positive mean returns, Sharpei 
becomes independent of / and of A: Sharpei \|'^p^ becomes completely 
controlled by the stochastic part of the process. 

The existence of a maximum ([1^) was not predicted by the linear analysis of the 
previous section. It stems, as we now analyze, from the existence of large deviations 
inherent in the multiplicative processes P,pi]|). 

2.3 Large risks and large deviations 
2.3.1 Typical versus average returns 

It is a well-known fact that average and typical values can be drastically different in 
multiplicative processes. This difference is at the origin of the existence of a maximum 

Let us come back to the exact expressions ([TUp and (|n|). The mathematical 
expectation {S{t)) of the value S{t) of the portfolio corresponds to taking the average 
over a large number of realizations, in other words over a large number of investors. 
This will be a useful quantity for a bank, say, as opposed to a single investor. It is 
given by 

{S{t)) = j:V{j)S,{t). (20) 

j=0 

This expression can be summed and yields 

(Sit)) = (p[l + (A - 1)/] + (1 - p)[l - /]) = (ipX - l)f + l)\ (21) 
We thus simply retrieve the expression (|) taken to the power t, i.e. 

(1 + (gain over one time step ))* . (22) 
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{S{t)) is not the same as the most probable value Spp(t) given by ([15|) . {S(t)) and 
Spp(t) have very different behaviors. Figure 4 gives the dependence of and 
5*^^(1) as a function of / for the case p = 1/2 and A = 2.1 studied previously. As 
can be seen from (|21|), (5'(1)) is a linear function of / while Spp{l) exhibits a highly 
nonlinear dependence. Technically, the difference can be tracked back to the fact that 
Spp(t) is the exponential of the average of the logarithm of S{t), i.e. the geometrical 
mean of S(t) : 

Spp{t) = e^"^^'^'^\ (23) 

which can checked by calculating (log5'(t)) = Yfj=o'P{j)^og Sj(t). This is to be 
compared to 

{S{t)) = (e'°s^W), (24) 

which is the arithmetic average. For / small for which the return is close to zero, 
the two values become indistinguishable, as can be seen in figure 4. / << 1 implies 
small fluctuations of wealth and the two averages become identical in this limit. 

In contrast for arbitrary /, {log S{t)) and log(S'(t)) can be very different. Math- 
ematically, the exchange between the mean and the exponentiation is not justified. 
Expanding the exponential in (^4]) yields 

(Sit)) = 1 + (log5(t)) + ^((log5(t))2) + ... , (25) 

that can be compared with 

Sppit) = e<'°s^W> = 1 + (log^(t)) + i(log5(t))2 + ... . (26) 

The difference between the two quantities occurs at the second order in the expansion. 
More generally, for any positive random variable, {x'') > {x)^ and thus {S{t)) > 
Spp{t). As an illustration, figure 4 shows that (5'(1)) ~ 1.002 while Spp{l) ~ 1.001 at 
the maximum of the mean return. After 1000 time steps, this leads to (S'(IOOO)) ^ 7.4 
while S'pp(lOOO) ^ 2.7. The difference explodes exponentially at large times. Notice 
that, for / > 0.09, r becomes negative and thus Spp(t) decreases exponentially while 
{S{t)) still increases exponentially: a typical investor will loose almost surely while 
very rare investors will exhibit shameless gains of such a magnitude as to make the 
average economy still growing! 
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The origin of these differences can be found in the large fluctuations of multi- 
plicative process, exhibiting rare fluctuations but of sufficient magnitude to control 
the behavior of the mean gain. For instance, a sequence of positive returns such that 
there are j = t successes occurs with an exponentially small probability. However, 
its contribution to the mean gain has an exponentially large magnitude. As a con- 
sequence, this type of scenarios brings in an important contribution to {S{t)) and a 
dominant contribution of higher moments. 

The difference between (|23|) and illustrates dramatically the importance of 
diversification. The result (p^) quantified the results obtained by an investor A who 
could divide his initial wealth in independent assets, each asset corresponding to 
the couple (/ invested in the risky game, 1 — / kept at the riskless return), each 
playing a given game independently of the others. This diversification is gratifying 
as his total wealth is controlled by the mean gain, this becoming true in the limit 
of large A^. This is very different from the result (^) obtained by an investor B 
who has only a single asset and is thus completely controlled by a single scenario. 
In practice, for the wealth to be described by p^), the number of assets A^ must be 
exponentially large in t |^ : N oc e'^^, where c > is an increasing function of /. At 
short times, it is not very useful to diversify, but as time passes by, the diversification 
becomes essential. This introduces an additional level of complexity : for practical 
implementation, A^ can not be arbitrarily large ; one can thus imagine investment 
strategies which modify / such that the number N{t) necessary to track the mean 
gain remains bounded. 

These effects are well-known in the physics of disordered media and are at the 
origin of a wealth of effects, culminating in the non-ergodic behavior of spin-glasses. 
The consequence of large fluctuations in multiplicative processes in the context of 
portfolio selection has also been stressed recently 0. 

2.3.2 Large deviations 

Large risks are characterized by the existence of deviations from the behavior de- 
scribed by the central limit theorem. 

Indeed, the previous considerations have relied on the fact that the fluctuations of 
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the return could be adequately quantified by a unique number, namely the variance. 
For those fiuctuations of S{t) away from the most probable value Spp{t) given by ([T5| ) 
that are larger than the standard deviation of the Gaussian approximation (|1^, the 
average return and the variance are no more enough to quantify the whole spectrum 
of risks. This problem is very important because, as already pointed out, there is not 
a unique optimal investment strategy but rather a set of strategies each correspond- 
ing to a given risk aversion. Since risk aversion relies on a subtle combination of 
psychological and financial considerations, it is very important to provide a reliable 
appreciation of all the possible dimensions of the risks. 

Technically, an answer is provided by the theory of large deviations, which es- 
tablishes the probability, for large t, for the mean of independent and identically 
distributed random variables : 

1 * 

Prob[- J2 - a;] ~ e*'(^) , (27) 



where s{x) is the Cramer function |TT|, |T2[. When applied to the binomial law 
( |Tl| ) where the sum is of the type Yli=i with Xi = 1 with probability p and Xi = 
with probability 1 — p, this result is nothing but the one obtained from the Stirling 
approximation of : 

p(j =a;t) = e*^("), (28) 

with 

s(a;) = xlogp+(l— a;) log(l— p)— a;loga:; — (1 — x) log(l — x) , for < x < 1 (29) 

s{x) = — oo otherwise . (30) 

Notice that s{x) reduces to s{x) = — 2p{i-p) neighborhood of its maximum 

X = p, which retrieved exactly the expression ([1^) used until now. Figure 5 shows 
s{x) and its parabolic approximation for p = 0.5 and p = 0.95. In the first case 
p = 0.5, the parabolic approximation, leading to the Gaussian distribution and to all 
the previously quoted results, overestimates the risks. For instance, the risk to loose 
systematically at each time step corresponds to a; = 0, i.e. V{j = 0) = e*'^'''^^ where 
s(0) ~ —0.5 according to the parabolic approximation, which can be compared to 
the exact value —0.7. The probability to loose at each time step during t is thus 
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estimated as e~°'^* ~ 7 10^^ and e""'^* ~ 10"'^ respectively for t = 10, leading to an 
overestimation of this risk by a factor 7 in the Gaussian approximation. For p > 0.7, 
the effect goes in the other direction. For instance, for p = 0.95, we see in the 
figure 5b that the function s{x) is significantly above its parabolic approximation. 
As a consequence, large deviations are much more probable than predicted by the 
Gaussian approximation and the standard mean-variance theory. 
In order to quantify this result, we express ([10|) as 



using j = xt. The distribution V{x) is given by with V{x) with the expres- 
sion (0) provides the distribution of S(t) by eliminating x. We can also use these two 
formulas by using x as a parameter : for S{t) given, we determine the corresponding x 
from (^) which is then reported into (|28| ) with (p9D to derive the probability. Recall 
that X is simply the frequency of gains. Large deviations correspond to a frequency 
X different from p by an amount of order one. 

As an illustration, let us take p = 0.95 and A = 1.1, for whose realistic behaviors 
are found (5% probability to loose and a gain of 10%). Let us assume that the investor 
chooses / = 0.4 (which is close to the maximum) which corresponds to an average 
return r(/ = 0.4) = 1.2%. This result is also obtained from (|3T|) with x = p = 0.95, 
using the definition S(t) = e^*. What is the probability that, instead of getting the 
expected return, this investment looses on average —1.6%? This value corresponds to 
X = 0.9 as seen from (|3T|). The calculation of s{x = 0.9) gives —0.0264 in the parabolic 
approximation and —0.0206 with the exact expression (^). The probability of this 
bad luck is thus e~° '^^^^* in the Gaussian approximation and e~°-°^°^* according to the 
Cramer expression. For t = lOOr, we find respectively a probability equal to 7% and 
13%. Thus, the Gaussian approximation under-estimates by almost a factor of two 
the probability of such a scenario. 

Consider another example : what is the probability that the average return be 
equal to —0.5%? Following the previous steps, we find that this return corresponds 
to X ^ 0.92 for which s{x = 0.92) = —9.5 • 10~^ in the parabolic approximation and 
s{x = 0.92) = —8.1 ■ 10^^ for the exact expression (^9]). For t = lOOOr, we obtain 
7.5 ■ 10~^ and 3 ■ 10^^ respectively for the probability to get such a return. Here, the 




(31) 
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Gaussian approximation is off by a factor 40! 

To sum up the section, it is not possible in general to capture the degree of risk 
to which one is exposed to by the single variance parameter. A natural extension is 
to consider the full Cramer function as the quantity encoding the risk. The choice of 
the best investment becomes then even more dependent on additional considerations 
involving the aversion to rare and relatively large risks. 

Is it possible to choose and investment that minimize the large risks? In the 
present simple model, the answer is : choose / = is you are afraid of large risks. To 
see it, we pose the problem in the following way: we try to minimize the probability 
that the return be less than some level p, whose specific value is investor-dependent 
and not specified as long as it is sufficiently in the tail. This level p of a minimum 
tolerable return corresponds to a frequency Xp{f) by the equation 

(l_/)i-p[l + (A-i)/fp = e''. (32) 



With (0), it gives Xp as 



x.=v^^( ^'':^^)'^' )- '^^> 

where cr(/) = \Jv[f) is the standard deviation. The worst return is of course log(l — /) 
for which x is zero. Notice that the value Xp = p corresponds to the typical return 
r(/) given by (p^. For p < r{f), the frequency of gains Xp is less than p. 

The probability to observe a return less or equal to p is Jq'' P{x)dx = 
Yff=o ijjP'i^ ~ P)* which each term behaves like e^^^^^^^^\ Is there an opti- 

mal / such that Jq'' e*'^'-^''-' is minimum? Using (^), the extremum of Jq'' e^^^^^dx with 
respect to / is given by the condition = ^ = — -j^^ + i+(a-i)'/ • However, this cor- 
responds to a maximum since the second derivative is negative. The value /i which 
makes vanish ^ corresponds to a maximum of probability to gain a severa loss. 
These values are thus to be avoided. With (p3D, they are 1 — / = ^^(1 — Xp) with 
P = log A^ + (l-a;p) log(l-Xp)+XplogXp+Xplog(A-l). (1-Xp) log(l-Xp)+XplogXp 
being negative with a minimum — log 2 for Xp = 1/2 and vanishing for Xp = and 
1, there are non-trivial solutions for Xp < p. There exists other situations where an 
optimal investment can be devised such as to minimize the probability of large losses, 
as we discuss below. 
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3 Portfolios made of uncorrelated assets 



Consider the sum 

N 

Sit)=Y.pMt) , (34) 

i=l 

where Xi{t) represents the value at time t of the i-th asset and pi is its weight in 
the portfoho. By normahzation, we have Y^i^iPi = 1- S(t) is the total value of the 
portfolio made of assets at time t. 

3.1 Gaussian limit and measure of risk by the variance 

We would like to characterize the probability distribution Ps{S(t)) of S(t), knowing 
the distributions Pi{xi) of the Xi for the different assets. For uncorrelated assets, the 
general formal solution reads 

Ps{S) = J dXiPi{Xi) J dX2P2{x2)... J dXjPj{Xj)... J dxiyPN{xN)s(^S{t)-^piXi{t)^ . 

(35) 

Taking the Fourier transform Ps{k) = J^^ dSPs{S)e-'''^ of (H) gives, by the defi- 
nition of the characteristic function 11131, 



Ps{k) = P^{p^k)P2{p2k)...P,{p,k)...PM{pNk) . (36) 

This equation expresses that the Fourier transform of Ps{S) is the product of the 
Fourier transform of the distribution of each constituent asset with an argument 
proportional to their respective weight in the portfolio. We now use the cumulant 
expansion of the characteristic function : 

logP,ip,k) = -tkpjC^ixj) - ^pp^C2{x,) + 0{p'k^) , (37) 

where Ci{xj) = {xj) is the average of Xj and C2{xj) = {x'j) — {xj)^ is the variance 
of Xj. We assume for the time being that these cumulants exist and are finite. The 
notation 0{p^k^) means that the higher order terms are at least of order p^k^. In the 
limit where the number of assets is very large, the weights pi are of order (we 
avoid the special cases where one or a few assets are predominant). In this case, the 
previous cumulant expansion is warranted since all arguments are small and, keeping 
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terms up to the quadratic order, we obtain 



N I ^ 

Ps{k) ~ exp[-tkY,p,C,ix,) - -k''J2Pp2ix,)] . (38) 
i=i ^ i=i 

Its Fourier inverse is 

P^(5) ^^=exp[-^-^^^] , (39) 
where the average of S is 

N 

{S) = Y.pA^^) , (40) 
i=i 

et la variance de s'ecrit 

N 

V = Y.Pp2{xj). (41) 
i=i 

These expressions express again the vahdity of the central limit theorem. 

3.2 Large risks : the case of exponentially distributed assets 



The previous Gaussian approximation (39) does not apply for large risks. We have 



already met this fact in the binomial model with the difference between ([1^) and ( Pq , 
pg]) . We now present a slightly more general illustration of this fact by studying the 
case of assets that are exponentially distributed. It turns out that this situation is 
not far from reality when one deals with daily returns in . 



3.2.1 Determination of the distribution of portfolio values 

To simplify, we assume that the distributions of asset return is symmetric : 

PjiS^j) = ^«,e".l^--.l , (42) 



for — oo < 6xj < +00, where (Sxj) = and the variance 5xj is C2{vj) = ^ 
The Fourier transform of (33) is 



Inserting this expression in (|36|) and taking the inverse Fourier transform, we get 

1 <.+oo pikSS 
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We retrieve the Gaussian approximation by noting that 
1 



= exp -log[l + (fcp.a-^)'] ~ exp -(fcp.a-^)' , (45) 



1 + {kpjQj 
and thus 

Ps{5S) — dke'''' exp[-e Efe«7')') , (46) 
which recovers (p9|). 

The integral (Q) can be performed exactly by using Cauchy's theorem: one 
replaces the integral over the interval — oo < k < +cxd by an integral over a contour 
in the complex k plane. This contour is formed of the real axis and closes itself by 
a half-circle of infinite radius in the top half-plane. Cauchy's residue theorem then 
gives 

PsiSS) = \ E ^ ^ exp[-^] . (47) 

This result is correct only if all the values pjOj are different. The special cases where 
several values are the same do not pose particular problems and can also be explicitely 
treated. This expression (|1^) shows that the large risks are given by 

4n,.^,_(( '%y- )^-^ 

where 

a = (49) 

Pjm a X 

with jmax being the value of j which corresponds to the largest pjaj^. The order 
of magnitude of the largest fluctuations of 6S is thus a~^, i.e. provides a good 
estimation of the extreme risk of the portfolio. 

3.2.2 Diversification in the presence of extreme risks 

To protect oneself against the large risks, one should thus minimize a~^, i.e. solve 
the optimization problem on the weights pi,p2, ■■■,Pn, which consists first, for fixed 
weight pi,P2, ■■■,Pn, in finding the smallest ratio — that we note Mirij^i at — , and 
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then to determine the weights pi,P2, ■■■,Pn such that the smallest ratio be the largest 
possible. To sum up, we thus search for the solution of the max-min problem 

MaXp^^P2,,„^pj,Mini^j=i^,„N)— ■ (50) 

Pj 

To solve this problem, we invoke the following identity 

Mzn(,=i,...;v)^ = hmg^+Jj2{^)-'^) . (51) 
Pj \=i Pj ' 

The idea is to invert the limit q — >■ +00 and the maximization with respect to the 
weights. At q fixed, we then maximize this expression with respect to the weights "Pj 
together with the normalization constraint J2f=iPj = by using Lagrange multiplier 
method. The solution is (^)~'^ = constant, independently of j. Then taking the 
limit q — > +00, the only possible solution is pj oc aj, which gives 

Pk = ^ , (52) 

after normalization. We will derive again this result below (see the equation 



by a different argument in terms of the minimization of the portfolio kurtosis, thus 
providing a different perspective to this result which constitutes the large deviation 
correction to the mean-variance approach for exponentially distributed assets. The 
same results have been derived independently in Ref. |1^ . 



3.2.3 Large risks and optimal portfolios 

A better measure of the risk than the sole knowledge of a is provided by the probabil- 
ity that the loss 6S of the portfolio be larger than some threshold A. The parameter 
A is a priori arbitrary and is chosen by the investor in view of his own risk aversion. A 
is a so-called VaR, or value-at-risk. It is the value that can be lost at the probability 
level J:^Ps{5S)dv: 



/ Ps{SS)d6S = -j: ^j^, ^exp[ 



-ij- (53) 
Pj(^j 

This expression provides the probability to observe a loss larger than A during the 
unit time step considered here. Reciprocally, we can choose the confidence interval, 
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say 95% and determine what is the threshold A such that the maximum loss remains 
smaller than A in 95% out of all scenarios. The VaR A is solution of 

PsiSS)d5S = 0.05 , (54) 

J—oo 



which can be explicitely solved using 

Let us now determine the weights of the portfolio that minimize the probability 
of loss larger than A. The expression (|53|) is not very convenient for an analytical 
calculation. We thus turn to a more robust measure of the VaR given by the fact 
that a smooth estimation of JZ^ Ps{SS)d6S is provided by Ps{6S){l — e^)d6S. 
Indeed, 1 — is close to 1 for 6S < —A and is negligible in the other case. (1 — e^) 
plays a role similar to a utility function, quantifying the sensitivity of the investor to 
large fluctuations. This leads to the following smooth definition of the Var A : 



TLp=,[Ps{\6S\))-TL^^^(Ps{\6S\)j=l~p, (55) 

where TLp(^f{x)^= e~^^ f{x)dx is the Laplace transform of the function /(x). 

The first term TL^=o ^-Psd^'S'l)^ is nothing but the total probability for a loss to 
occur (irrespective of its amplitude). This expression ( |S^ has two advantages: i) a 
more progressive interpolation of the losses to determine the VaR and ii) the use of 
the Laplace transform which can be directly estimated in the case of portfolios from 
the product of individual Laplace transforms of the distribution of each asset. 

Let us change variable and write pj = p'j, where the pj are the novel parameters 
over which to minimize. This change of variables ensures that the weights remain pos- 
itive. Notice that we could relax this constraint and allow for negative weights which 
would correspond to so-called "short" positions. We thus would like to minimize 

TL^=o {Ps{\SS\)^ -TL^^^. {Psi\SS\)^ -7 E (56) 

with respect to the pi, p2, Pn- 7 is a Lagrange parameter that ensures the nor- 
malization of the weights. Using the analytic expression of the Laplace transforms. 



we can write (pq) as 

N 

1-p(A)-7Ep^ (57) 
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where 

pW - r,^^t?^ 2, ■ (58) 
Putting to zero the derivative of (0) with respect to each gives 

1 A ^ 

= iv + iv ~ ■ ^^^^ 

This solution exists provided that A is not too large such that the p^'s remain positive. 
7 is eliminated by the normalization. 
For large A, 

1-KA)^E-' (60) 
which recovers the situation treated in section 3.2.2. 

3.3 Beyond the Gaussian limit : cumulant expansion and 
large deviation theory 

For arbitrary assets with finite variance, the VaR at the probability level p is given 
by 

/ PsiSS)d6S = 1 - p . (61) 
The large deviation theorem allows us to get 

PsiSS) ~ e^^^-^^) , (62) 

where the Cramer function s{x) can be expressed in terms of all the distributions 



PjiSxj) for J = 1 to iV |T0 



/ 1 ^ \ 
s{6S) = Inf^(^-Y.^ogP,{p,P)+P6Sj . (63) 

Inff^ expresses the fact that we evaluate the term within the parenthesis for the value 
of (3 which minimizes it. This expression ( |5B| ) together with ( |5^ gives the distribution 
of deviations that can take arbitrarily large values, i.e. much beyond the Gaussian 
approximation . 

This expression also contains the Gaussian limit valid for small fiuctuations. This 
can be seen from the formula { ^7\j adapted to the Laplace transform (by replacing ik 
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by j3). Noting {5S) = j:f=iPjCiiSxj) and V = Y.f=iP]C2{5xj) (equations (0, 0)), 
the expression (^3D becomes 

Ns{5S) = Inff,{-l3{5S) + ^Vp^ + p5S) . (64) 

s{6S) can then be obtained as the solution of a simple quadratic minimization. The 
value of f3 that minimizes the expression within the parenthesis is 

We thus finally obtain 

iss-{ss)Y 

s{SS) = ^^^^^ . (66) 



Reporting in (p^D, we thus retrieve the Gaussian approximation. 

Let us now use ( |63|) to express the first leading corrections to the Gaussian ap- 
proximation (|39D . In this goal, we expand log Pj{pjP) on the cumulants c^: 

logP,(/5) = E^Mr • (67) 

n=l 



Interchanging the sums over the assets j and over the cumulants n, we write (|63D as 
Ns{6S) = Inffs[^3{S - {S)) + \vp' - ^C,f3' + ^C,f3' + , (68) 
where {S) and V are given by (^) and (|4T|), 



and 



N 

Cs = T.pH^ (69) 



N 



If the distributions of the price variations of the assets are non-symmetric, then 
6*3 7^ and the first correction to the Gaussian approximation reads 

For symmetric distributions such that (4 = 0, the leading correction is proportional 
to the kurtosis k = ^ : 

P.,5).exp|-M!(l-5^M1Z(^)]. p2) 
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For a typical fluctuation S — (S) ~ yV, the relative size of the correction is order 
= ff- Notice the negative sign of the correction proportional to C4 which 
means that large deviations are more probables than extrapolated by the Gaussian 
approximation. 

Let us illustrate these results for the exponential distributions. Consider the 
simple case where all assets have the same distribution, i.e. aj = a for all j's. Let 
us also take all weights pj equal to The expression (|63D then yields 

s{6S) = log{a6S) + 1 - a6S. (73) 

We recover directly this result by using the exact relation (|4^). Indeed, the family of 



Gamma functions is close under convoluation (||T6[, p. 47). Applying this result to 
the variable 63=-^ Y^jLi ^^j, this explains the additional factor in the exponential 
and the prefactor. We thus have 

PsiSS) = ^^iNa6Sf-'e-^'^''. (74) 



The expression (^) reported in (|6^ ) yields indeed (|7^). This result shows that the 
extreme tail of the distribution remains essentially of the exponentialform e"^"'''^, in 
agreement with our previous results. 

The figure 6 illustrates these results by showing the function s{y = a6S) = 
\og{y) + 1 — y, in comparison to its parabohc approximation Sg{y) = —^{y — 1)^. It 
is clear that the parabolic approximation is correct only for small deviations around 
the mean value y = I. For \y\ = 3, Sg{y) is already twice as large as s{y), leading to a 
severe under-estimation of large fluctuations by the Gaussian approximation. Notice 
also that for large y, the function s{y) becomes essentially parallel to the straight line 
of slope —1, thus justifying the asymptotic shape (^) of the distribution tail. 

4 General analysis of portfolios 

Let us consider assets X°', a = 1, ..,N, each of which has an instantaneous value 
xl at time k. We now present the analysis of portfolio optimization in the presence of 
large fluctuations and for the general case of correlated assets. Before reaching this 
goal, we first recall the standard mean-variance approach valid within the Gaussian 
approximation. 
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4.1 Correlation matrix between assets 



Within the Gaussian approximation, correlations between assets can be fully charac- 
terized by the determination of the correlation matrix 

Vab = {dx'^dx') , (75) 

which also reads V = (X^X*^^) in matrix notation. X" is the column vector with row 
element x°', for a = 1 to X and the exponent stands for the transpose operation. 
By construction, Vat is symmetric and for non-singular cases can be diagonalized 
with all its eigenvalues being real. The elements of the eigen-vectors Pa are 
also real [0. Notice that Vab is a symmetric matrix with only X and not X(X + 
l)/2 independent degrees of freedom. There is thus an orthogonal matrix A such 
that A^VA = X'^, where A'' is the diagonal matrix made with the eigenvalues of 
V. Recall that A is orthogonal if A~^ = , i.e. J2c^acAbc = Sab- Then the vector 
f/" = A^X" has the following correlation matrix (A^X"(A^X")^) = A^(X"X''^)A = 
A^VA = A*^, i.e. 11°' has all its components that are uncorrelated with each other. 
The quadratic average ((f/")^) of an element of this vector is then equal to the 
corresponding eigenvalue A^. We can thus decompose the assets over the set of 
independent factors U°', in matrix form = AUa and explicitely 

N 

xl = Y.Aa,u\. (76) 

b=l 

Since the U"' are independent and of variance A, the correlation matrix (X"X'*^) is 
equal to AX'^A^ , i.e. 

{Sx^Sx'') = {AX''A')ab = E AcAc^c (77) 

c 

This decomposition thus reduces the problem of correlated assets to the previous case 
of uncorrelated assets. Within the Gaussian approximation, the correlation matrix 
V determines completely the distribution of price variations through 

PiX") oc exp (^-^X"^\/-iX'^^ . (78) 

We can verify (^) directly from ( [78D by making explicit the calculation of the corre- 
lation with the probability ([78|) . 
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4.2 Optimal portfolio within the mean-variance approach 

Consider a portfolio made of assets with respective weight pa, with a = 1 to A^. 
The variation 6S{t) of the value of the portfolio during the time interval r is 

N 

5S{t) = J2pJx''{t) (79) 

a=l 

(= p^X"- in matrix notation). The variance {[5S{t)Y) is 

{[5S{t)f)=p^Vp. (80) 



Notice that we can retrieve this result from ( ffSD by using the fact that the distribu- 
tion of 5S{t) is formally P{5S{t)) oc ^ dX^P{X'')5{S - Ea=iPJx''{t)). Its Fourier 
transform can be expressed under the form 



n( J dxi) e^^y--XiAT^^Xj + yiXiJ = ^ ^^^^^ ^ exp(y^A,jt/j) , (81) 

which by inverse Fourier transform gives 

P(55(t))ocexp(-|M£). (82) 

It is useful to express {[5S{t)\'^) = p^Vp under a form that exploits the decompo- 
sition over the independent components U"'. Reporting X"- = AU°' in 6S{t) = p'^X"-, 
we obtain 6S{t) = p^U°', where 

p = A^p . (83) 

This expression represents the portfolio as made of a set of effective assets U"', a = 1 
to which are uncorrelated, with effective weights p". We can thus directly use the 
previous results as soon as we specify the ?7"'s. We have 

N 

{[6sm=p^x'p = Y.^am' ■ (84) 



a=l 



The average return is given by 



N 



{6S{t)) = Y.Va{5x\t)) =p^{X^)=p^{U^). (85) 



a=l 
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Within the Gaussian approximation, the risk associated with the portfoho defined 
in terms of the weights is quantified by the variance {[6S(t)]'^) — {6S(t)y given 
from (1^) by replacing V by the covariance matrix, i.e. by substracting {Sx"") {6x^) to 
{6x°'6x^)). In the sequel, we use the same notation V for the covariance matrix (and 
the other derived ones). 

The mean-variance approach developed by Markovitz, that we already visited 
in the first sections, consists in looking for the weights p"" such that the return be 
the largest possible for a given variance {[SS{t)]^) - {SS{t)f or equivalently that the 
variance ([^^(t)]^) — {6S{t))'^ be minimum for a given return. The solution is obtained 
by minimizing the following expression with respect to the weights p"" 

N N N 

j=i j=i j=i 

where ai^2 are the Lagrange parameters introduced to constraint the minimization at 
fixed gain {6S{t)) and to ensure the normalization of the weights. 
The extremalization with respect to a weight gives 

I^^Eff -i:(2A.P'-(«,t,.)-«,(.%). (87) 
which gives the following matrix equation 

A'^p = ai(f/«) + a2l , (88) 

where 1 is the single column vecteur with unit elements. With (^3]), (f/") = A'^{X°-) 
and V = AX'^A^, we finally obtain 

p'' = aiV-,'{X') + a2Aatl' . (89) 

Imposing Y^iLiP^ = 1 and J2iLiPi{X^) = {^S{t)) yields ai and 0^2 as functions of 
{6S{t)). Varying {6S{t)) enables us to derive the mean- variance curve, called the 
"efficient frontier" . ai can be interpreted as the risk aversion parameter. Numerous 
books discuss these solutions [[l^] . 

Nothwithstanding a wide application due to its convenience and simplicity, the 
mean-variance approach hides several severe problems. 
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• Mean-variance portfolios are not very diversified and have the tendency to select 
assets with comparable risks. 

• A frequent reallocation is called for to address the non-stationarity of the esti- 
mation of the covariance matrix. 

We now expose several successive generalizations of the mean-variance approach that 
specifically address the limitation of the Gaussian approximation. 

4.3 Quasi- Gaussian parametrization 

It may be useful to attempt to represent the multivariate distribution of price varia- 
tions of N assets by the following expression 

P{X) = f(^{X- XYV-\X - . (90) 

X° is the unit column vector of the average of the price variations. The function F is 
kept a priori arbitrary. Notice that if F is an exponential, we retrieve the Gaussian 
distribution (|78|) and V becomes the covariance matrix. Consider a portfolio with 
weights given by the unit column vector p. Its value variation during the unit time is 
5S{t) = Yl,a=iPa^x'^{t) = P^X using the matrix notation. The distribution P{5S)d5S 
can be written as 

P{6S) = J dXF({X - XYV-\X - X'^)y{6S - p^'X) . (91) 

To estimate this integral, we isolate one of the assets Xi = + yi and write, using 
F = X - XO, 

YTy-'Y = V{{Vi + 2{v^y)yi + y^v-'y , (92) 

where V^J^ is the element ij of the matrix V~^, y is the unit column vector 
(?/2, Z/3, yN)^ of dimension N — 1, v is the unit column vector (V^7\ ^31 ""^i ■■•) ^tvi )^ 
of dimension X — 1, and is the square matrix of dimension X — 1 by X — 1 derived 
from by removing the first row and first column. The factor 2 in 2{v^y)yi comes 
from the symmetric structure of the matrix V^^. 

We can now express the condition 6{6S — p^X) in the integral (0) : 

-5{y, --{6S- P^Xo - P^y)) , (93) 
Pi Pi 
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where V is the unit column vector (p2, Ps, ■■■,Pn)'^ of dimension A^— 1. The integration 
over the variable yi cancels out the Dirac function and we obtain the argument of 
the function F under a quadratic form in the variables S et y. Using the identity 

X^V-^X + X^Y = X^V-^X - ^Y^VY , (94) 

where X = X + VY, we obtain 

P{6S) = J dyF{y^M-'y + , (95) 

where the integral is carried out over the space of vectors y of dimension N — 1 and 

M-i = V-^ - —{v - y^V)V^ . (96) 
Pi 2pi 

We can finally write 

where jF(x) is defined by (^). The prefactor -pTyp of SS"^ is simply deduced by 
remarking that it is independent of the function / and thus equal to that obtained 
for the Gaussian case. 

This remarkable result shows that the typical amplitude of the fluctuations of 
the values of the portfolio is still controlled by the quasi-variance p^Vp defined as 
in the Gaussian case (|8^) . It is then natural to optimize the portfolio using the 
quasi-variance as the measure of the risk. This parametrization provides a natural 
generalization of the standard mean-variance Markovitz approach. There is however 
a danger for the largest fluctuations in relying only on this insight. Indeed, the 
expression (|95D for J-'{x) shows that the explicit dependence of the distribution P{6S) 
is in general a function of the asset weights constituting the portfolio, beyond the 
simple dependence in p^Vp. Minimizing only p^Vp may thus be insufficient because 
it may be linked to a dangerous deformation of J-'{x) in the tail. 

4.4 Generalization to non-gaussian correlated assets 

We assume that it is still possible to decompose the assets X" on a set of effective 
independent assets U"' : 

X" = AU" . (98) 
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The unit column vector U"' can be interpreted as the set of "explanatory" factors 
of the price variations. For this decomposition to be useful, the matrix A should 
be constant while X" and f/" fluctuate in time. This provides a set of orthogonal 
assets whose fluctuations are uncorrelated. An estimation of the matrix A can be 
obtained from the covariance matrix as above. A first generalization consists in 
relaxing the condition that the U"' be distributed according to a Gaussian distribution 
as in the previous section. From the data X"" and the construction of the matrix A, 
we can study each effective asset U"" and determine its distribution P^{U°'). If the 
decomposition works and the effective assets are not correlated, we can write for 
each effective asset U°' the cumulant expansion of the Laplace transform of {U"") 



under the form (|67|). Since the variation of the portfolio value is given as before by 
5S{t) = p^X"" = p^U"', we obtain the Cramer function of the distribution of 6S(t) 
under the form (^) with ( pP| , |7DD where the pj^s are replaced by fP = A^p according 



to the equation (^3|). The cumulants corresponds to the effective assets U"". We thus 
obtain the distribution of 5S{t) which accounts for the first leptokurtic corrections 
given by ([71|) for the non-symmetric case and by (^) for the symmetric case. 

Let us consider the symmetric case (0). This expression shows that the risk is 
not uniquely represented by the variance V but also by the coefficient C4 as well as 
by all higher order cumulants. There is a danger in working only on the variance 
because minimizing only the variance may lead to larger fluctuations than before 
the minimization! To see this, we note that typical fluctuations S — (S) ~ V have 
a probability becoming much larger than the Gaussian estimate because, if V be- 
comes small by the action of the naive mean-variance optimization, when C4 does 
not decrease in proportion, the correction term ^ remains large. At this order in 
the perturbative expansion, it is necessary to take into account non only V but also 
C4. The minimization of V must be performed by controlling and even maximizing 
simultaneously the ratio this at fixed return. The second condition ensures that 
the weight of the non-Gaussian tail remains small, thus mastering the large risks. 

This optimization problem has no unique solution due to the existence of several 
contradictory constraints. In standard economic theory, one relies on the use of utility 
function to decide the relative importance of the different constraints. For the sake 
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of illustration, let us propose the following strategies. 

• Minimize V while maximizing ^ can be obtained by minimizing the ratio 

V/i^f = V'-'^C! with 0<P<1, (99) 

(_/4 / 

to ensure that both V and C4 are simultaneously decreased. /3 quantifies the 
relative weight given to the kurtosis. 

• One can introduce a parameter which quantifies the sensitivity of the investor 
with respect to large fluctuations measured by C4 and minimize the function 

V-ij^-a,R-a2j2pj , (100) 

where ai ensures a fixed return and 02 accounts for the normalization of the 
asset weights in the portfolio. 

4.5 Exponential distributions 

We have already analyzed the problem of a portfolio constituted of assets with ex- 
ponentially distributed price variations. Let us reexamine this problem within the 
framework of the cumulant expansion. We work directly on the effective explanatory 
assets U"" that are non-correlated. 

The distribution Ps{SS) is given by (^4]) with the pj's replaced by pj (in the 
sequel, we omit the hat). The coefficients aj are the exponents of the exponential 
distributions of price variations of the independent assets. An expansion in power of 
k"^ as in (|^) but up to the order gives 

1 / ^ k'^ ^ \ 

PsiSS) = - rfte^'^^^exp -A:2 5:(p,a-T + -^(p,a7Y + --- • (101) 

Ps{5S) is thus of the form (|Tl|) with V = 2 Ef=i(Pia7^)^ and C4 = Ef=i{Pjajy- 
Let us assume that all assets have the same return, fixed to zero without loss of 
generality. We thus focus our analysis to the minimization of the risk. 

The first scenario consists according to the Gaussian approach to minimizing the 
variance ^^J^^lf^ \^ . The solution is 

(E,=iPi)' 



P^^ = • (102) 
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The second scenario consists in focusing on the large risks, here quantified by the 
ratio which describes the large fiuctuations of the price variations. The ratio ^ 



can be written as with Wi = (pjaj^)"^. It is maximum when all the weights 

Wi are equal, i.e. 



= ^ . (103) 

This is the result (|5^ ) already obtained from the condition of minimizing the proba- 
bility of large losses. 

Notice that the two strategies (|102|) and (|103|) become identical pk = if all 



assets have the same parameter a. Indeed, the large risks are diversified from the 
beginning since the probability tails are identical in this case. The strategy ( [L03| ) 
corresponds to balance the risks equally over all assets. In contrast, the portfolio 
( |102| ) over-controls the balance : if an asset exhibits a larger risk in the tail, i.e. its 
coefficient a is significantly smaller than the others, it is almost completely absent 
from the portfolio. If an asset has a large a, it will have a large weight in the 
portfolio and its parameter ^ becomes smaller than the other ones. We see here 
the general trend exhibited by solutions obtained from the mean-variance approach 
which correspond to a diversification only on assets presenting comparable risks, while 
excluding almost completely the more risky assets. 

The strategy ( |103| ), which implies that p^^^a'f^^ = J2jLiC(j is a constant indepen- 
dent of k, leads to a simple exact expression of the distribution of the fiuctuations 



6S of the portfolio value. Indeed, the formula (0) gives in a manner similar to (JTJ) : 



i=i ' ^ i=i 

This Gamma distribution converges in its center to the Gaussian law while keeping 
an exponential tail for the largest fiuctuations, with an exponent Y^jLiOij- This can 
be compared to the distribution ( ^7|) with the value of p/s given by ( |102| ) : 

^^=1 n.^,((S)^-i) ^ ^ 

On see clearly that, as expected, the large losses are better controlled by the strategy 
( |103| ) yielding typical fiuctuations of the order of 5S^'^^ ~ while the 
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strategy (|102|) leads to typical fluctuations of order SS^^^ ~ ^ , where amax is 

the largest of the exponents. Let us assume for the sake of illustration that the 
exponents a are distributed according to a Gaussian law with mean (a) and variance 
then 6S('^ ~ ^ and 63^ + 

4.6 General theory and cumulant expansion from the gener- 
aUzed correlation matrices 

4.6.1 Strategies 

Let us consider the case where the assets are distributed according to the arbitrary 
joint distribution 

P{xi,X2, ...,Xi, ...,XN,t/x[,xl, ...,x'^i, ...,x%,0) , (106) 

which is in general non-Gaussian and with arbitrary inter-asset correlations. A nat- 
ural strategy to determine a portfolio is to calculate the weights pk of the assets in 
the portfolio, A; = 1 to A^, which minimize the risk of losses larger than a chosen VaR 
A, this in the presence of other constraints for instance on the return : 



a j:^ Ps{5S)d5S - a, EUpA^xj) - «2 Ef=iPi 



(107) 



dpk 

for k = 1 to N, where ai and 02 are Lagrange parameters. A becomes a parameter 
that quantifies the risk aversion of the investor. The residual probability 'Pmm.(A) 
obtained after the optimization gives the mean frequency of the occurrence of losses 
equal or larger than A. This approach requires a three-dimensional representation 
along the following axis : 

1. the VaR A controlling the acceptable level of loss, 

2. the loss probability V{X) at this level, 

3. the expected return. 

It generalizes the usual two-dimensional representation of the mean-variance 
Markovitz diagram. This is the price to pay when the distributions of asset price 
variations are not quantified by a single risk parameter, the variance. Note that 
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there is another situation where the risks can be captured by a single parameter, 
i.e. the case of power law distributions quantified by the scale parameter. This case 
has been discussed previously [|19| and the general solution for the portfolio opti- 



mization has been given [jT9[ as a rather straightforward generalization of the usual 
mean-variance approach. 

In the same spirit as for the minimization of the probability of losses larger than 
VaR, let us mention the strategy consisting in maximizing the probability to obtain 
a minimum return J^^.^ Ps{SS)d6S [^. Generalizing further, we can propose to 
minimize the weight JZ^ Ps{SS)d6S of the losses in the presence of the constraint 
that the probability of a minimum gain is fixed, thus generalizing in the probability 
space the mean- variance concept. This view point is stimulated by the observation 
that the returns and losses aggregated over long period of times are often mainly 
caused by large amplitude price variations that occurred over a very tiny fraction of 
the total time of the investment. For instance, for the US SSzPbOO index, from 1983 
to 1992, 80% of the total return stems from 1.6% of the trading time. This leads to 



optimization problems similar to ( |107|) . 

The most general approach consists in first determining the complete distribution 
P{6S{t)) of the price variations of the portfolio as a function of the distributions of 
the underlying assets. Once characterized, one knows fully the impact of the asset 
weights on the portfolio. The optimization of the portfolio can then proceed using 
suitable risk measures. 

4.6.2 Distribution of the price variations of the portfolio 

P{6S(t)) is formally given by 

N 

P{SSit)) = J dX-P{X-)6{S -Y^pa^x^t)) , (108) 

a=l 

where we use again the matrice notation 

Pi^X") = P{6xi, 6x2, •-, Sx,, ...Sxn) (109) 

and / dX"- stands for integrals over the price variations of the assets. The 
probability being positive, we can parametrize it without loss of generality 

P(X") = exp (^—X'^'^V-^X'' - ViX"")^ , (110) 
31 



where we distinguish a Gaussian part exp[—^X"-^V^^X°-] and a residual term 
exp[— V(X")] representing the non-Gaussian part. It may be dominant over the 
Gaussian part. The expression (|110|) defines the Gaussian correlation matrix V and 
the non-Gaussian contribution V, which contains all terms with power strickly larger 
than 3 in X". With (|108| ), we obtain the expression of the Fourier transform of 
P{6S) : 

P{k) = J dX" exp(^-^X'''^V-^X'' - V(X") + H^X"^ , (111) 

where H = ikp and p is as before the unit column vector of the asset weights in the 
portfolio. 

The most general and powerful technique to determine P{6S) from the exact 
expression (|111|) consists in using the "technology" of functional integrals The 
basic idea is to reduce the evaluation of the integrals to that of Gaussian integrals. 
The key technical remark is 

d 



OH' 



J rfX'*exp(^-^X"^\/-^X'' - V(X'') + H^X" 



J dX^X" exp (^—X^^V'^X" - ViX") + H^X"^ . (112) 

Expanding the exponentiel as a formal series, we use the identity ( |112| ) and then 
resum the series to obtain formally 

P{k) = exp(-V(^)) / dX-exp(^-^X-^V-'X^ + H^X^^ . (113) 

The notation V(g^) means that each component X'^ is replaced by the operator -^p. 
By this trick, we have transformed a non-Gaussian integral into an operator applied 
on a Gaussian integral that can be calculated explicitely by using (pTF). This yields 



P{k) = exp(f: V(^))exp(ii7'^^\/i7'^). (114) 

Notice that we retrieve the Gaussian case for V = 0, i.e. the Fourier transform of 
(|8^) by replacing H by ikp. By the normalization of probabilities, -P(O) = 1. 
One can show |^ that P{k) given by ( |1 14| ) can be written as 

P{k) = e^(^) , (115) 
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where W{H) presents a systematic expansion 

oo 1 Af TV 

wiH) = E -y E - E H,,...H,^G'^\n, J.) . (116) 

n=l ji=l j„=l 

Hj for j = 1,...,N is one of the components of the unit column vector ikp. The 
functions G^J^\ji, ...,jn) can be expressed exphcitely in terms of V The usefulness 
of this formulation is that each H brings in a power of k : 

P{k) = exp(E ^ E - E Pn-P^.G'pin, j„) ) . (117) 



This expansion defines the cumulants c„ = (—i)'^-^P(k)\k = of the distribution 



[jciiiniuii uciiiico Liic uuiiiuiciiito = — t 

P{6S) : 

N N 

Cn=E-EP.l-P.nG«(jl,-,Jn). (118) 

Jl=l Jn = l 

Notice that the correlations between the different assets are taken into account in 
the functions G^*-*(ji, jj). They play the role of generalized correlation functions 
which quantify the pairwise, triplets, etc, correlations (a multiplet can contain the 
same asset several times, thus capturing the effect of self-correlations). 
One can also use another systematic expansion 

Pik) = E E - E Pn-H.G^Hju-,Jn) , (119) 

n=l ^- ii=l i„=l 

where the functions G'^*^(ji, j„) can be expressed in terms of the G^*^(ji, j„). 
The functions G^'^\ji, ■..,jn) are analogous to the moments which can be related to 
G^\ji, ■■.,jn) which are analogous to the cumulants. 

Let us persue this analysis and consider the symmetric case where the first term 
in the expansion of V{X) is quartic : 

N N N N 

v(x) = E E E E vabcdX^x'x'^x^+ 

a=l 6=1 c=l d=l 
N N N N N N 

E E E E E E va.cdefX^x'x'^x'x^xf + .... (120) 

a=l 6=1 c=l d=l e=l /=1 

Keeping for the time being only the quartic terms proportional to Vabcd, we obtain 



a /I ^ N N N N 

V(— -) exp -H-'^VH- )=T.T.J2J2 ^-bcd ( VabV^d + VaMd + VadVhc+ 

a=16=lc=ld=l 
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VabScSd + VacSbSd + VadSbSc + VbcSaSd + VbdSaSc + VcdSaSc + SaSbScSdj , (121) 

where Sa = ikYlk=i^akPk- We thus see all possible term combinations. In order to 
retrieve the fourth order cumulant, we notice that 

VabcdiVabVcd + VacVbd + VadVbc+ 



exp 

VabScSd + VacSbSd + VadSbSc + VbcSaSd + VbdSaSc + VcdSaSc+ 

(1 - ^(KbKd + VacVbd + VadVbc))SaSbScS}j\^ (122) 

retrieves ( |121[ ) by an expansion to the quartic order. Here, we simply use the fact 
that exp(aa; + hx^) = 1 + ax + {^a? + h)x'^ + .... We thus obtain 

N N N N 

Ci = Y.Hmi ^tU^ /, m)pjPkPiPm , (123) 

j=l k=l 1=1 m=l 

with 

N N N N , . 

Gtij, kJ,m)=2AY,Y.T.T. ^abcd 1 - -iYahVcd + VacVbd + KdHc) ) VajVbkVdVd^ . 
a=lb=lc=ld=l V ^ / 

(124) 

The higher order terms like VabcdefX°-X^X'^X'^X'^Xf also contribute to the fourth 
order cumulant as seen by generalizing (|121|) . In fact, C4 receives contributions from 
all higher order terms. They are weighted by the coefficients Vabcdef... which in general 
decrease rather fast. Diagramatic techniques can then be used to keep track of all 



terms at a given order in the systematic expansion 
4.6.3 Application to the quasi-gaussian case 

The quasi-gaussian case where the distribution P{6S) has the form ( P?] ) implies pre- 
cise constraints on the structure of the cumulants of P{6S) and thus on the correlation 
functions between the assets. Indeed, from the expression (|72|) giving P{6S) up to 
the first correction in terms of the kurtosis, we see that P{6S) is uniquely a function 
of p^Typ (where P^VP is denoted V in (0)) only if the cumulant of order 4 is pro- 
portional to with a coefficient of proportionality which is a pure number. As a 
consequence, the kurtosis must be a number independent of the asset weights in the 
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portfolio. For this to be true, the cumulant C4 given by (|123|) must factorize and is 
proportional to the square of the cumulant C2 : 

N N 

C2^J2EGlU,k)pjPk , (125) 
i=i fc=i 

where 

N N 

GlU,k) = Y.T.VajV,,. (126) 

a=l 6=1 

The identification term by term yields 

Gt{j,k,l,m)=wGl{j,k)Gl{l,m) , (127) 



where w is arbitrary. This expression (|127| ) together with ( |124| ) and ( |126|) determines 



the particular structure of the four-asset correlations Vabcd in the quasi-gaussian case 
(1971). 



5 Conclusion 

We have tried to demonstrate the analogies between the quantification of risks in 
finance and insurance and the optimization of portfolios on one hand and statistical 
physics concepts and methods on the other hands. The main message similar to that 



given long ago for random physical systems |22| is that a suitable risk assessment 
requires the study of the full distributions of price variations in contrast to the more 
standard variance approach. We have also shown how tools developed in statistical 
physics to address large fluctuations can be used in the optimization of portfolios. 
We thus hope to foster the interest of the physical community in these fascinating 
problems. 
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FIGURE CAPTIONS 



Figure 1: Return r and variance w as a function of / for p = 1/2 and A = 2.1. 

Figure 2: Return r as a function of v for the same values p = 1/2 and A = 2.1 as 
in figure 1. 

Figure 3a: The characteristic time t*{f) defined by (|^) as a function of / for r > 0. 

Figure 3b: The Sharpe parameter Sharpei = v^[t*(/)]^^/^ = defined for a 
unit time step function of /. 

Figure 4: Dependence of the average wealth (5'(1)) and of the typical wealth 
Spp{l) as a function of / after one time step, for the case p = 1/2 and A = 2.1. 

Figure 5a: The Cramer function s{x) given by (^) and its parabolic approxima- 
tion as a function of x for p = 0.5. 

Figure 5b: The Cramer function s{x) given by (pQ]) and its parabolic approxima- 
tion as a function of x for p = 0.95. 

Figure 6: The Cramer function s{y = a6S) = log{y) + 1 — y and its parabolic 
approximation Sg{y) = —\{y — 1)^- 
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